Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 148670 |
| Missing cells | 116710 |
| Missing cells (%) | 4.4% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 20.4 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Categorical | 10 |
|---|---|
| Numeric | 8 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
construction_type is highly imbalanced (99.7%) | Imbalance |
rate_of_interest has 36439 (24.5%) missing values | Missing |
upfront_charges has 39642 (26.7%) missing values | Missing |
property_value has 15098 (10.2%) missing values | Missing |
income has 9150 (6.2%) missing values | Missing |
ltv has 15098 (10.2%) missing values | Missing |
ltv is highly skewed (γ1 = 120.6153375) | Skewed |
upfront_charges has 20770 (14.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-08 23:08:05.829806 |
|---|---|
| Analysis finished | 2024-06-08 23:08:56.450696 |
| Duration | 50.62 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
gender
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Male | |
|---|---|
| Joint | |
| Sex Not Available | |
| Female |
Length
| Max length | 17 |
|---|---|
| Median length | 6 |
| Mean length | 7.9382391 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1180178 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sex Not Available |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Joint |
Common Values
| Value | Count | Frequency (%) |
| Male | 42346 | |
| Joint | 41399 | |
| Sex Not Available | 37659 | |
| Female | 27266 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 42346 | |
| joint | 41399 | |
| sex | 37659 | |
| not | 37659 | |
| available | 37659 | |
| female | 27266 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 172196 | |
| l | 144930 | |
| a | 144930 | |
| o | 79058 | 6.7% |
| i | 79058 | 6.7% |
| t | 79058 | 6.7% |
| 75318 | 6.4% | |
| M | 42346 | 3.6% |
| J | 41399 | 3.5% |
| n | 41399 | 3.5% |
| Other values (8) | 280486 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 880872 | |
| Uppercase Letter | 223988 | 19.0% |
| Space Separator | 75318 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 172196 | |
| l | 144930 | |
| a | 144930 | |
| o | 79058 | |
| i | 79058 | |
| t | 79058 | |
| n | 41399 | 4.7% |
| b | 37659 | 4.3% |
| v | 37659 | 4.3% |
| x | 37659 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 42346 | |
| J | 41399 | |
| A | 37659 | |
| S | 37659 | |
| N | 37659 | |
| F | 27266 |
Space Separator
| Value | Count | Frequency (%) |
| 75318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1104860 | |
| Common | 75318 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 172196 | |
| l | 144930 | |
| a | 144930 | |
| o | 79058 | 7.2% |
| i | 79058 | 7.2% |
| t | 79058 | 7.2% |
| M | 42346 | 3.8% |
| J | 41399 | 3.7% |
| n | 41399 | 3.7% |
| A | 37659 | 3.4% |
| Other values (7) | 242827 |
Common
| Value | Count | Frequency (%) |
| 75318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1180178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 172196 | |
| l | 144930 | |
| a | 144930 | |
| o | 79058 | 6.7% |
| i | 79058 | 6.7% |
| t | 79058 | 6.7% |
| 75318 | 6.4% | |
| M | 42346 | 3.6% |
| J | 41399 | 3.5% |
| n | 41399 | 3.5% |
| Other values (8) | 280486 |
approv_in_adv
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 908 |
| Missing (%) | 0.6% |
| Memory size | 1.1 MiB |
| nopre | |
|---|---|
| pre |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.6867801 |
| Min length | 3 |
Characters and Unicode
| Total characters | 692528 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nopre |
|---|---|
| 2nd row | nopre |
| 3rd row | pre |
| 4th row | nopre |
| 5th row | pre |
Common Values
| Value | Count | Frequency (%) |
| nopre | 124621 | |
| pre | 23141 | 15.6% |
| (Missing) | 908 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nopre | 124621 | |
| pre | 23141 | 15.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 147762 | |
| r | 147762 | |
| e | 147762 | |
| n | 124621 | |
| o | 124621 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 692528 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 147762 | |
| r | 147762 | |
| e | 147762 | |
| n | 124621 | |
| o | 124621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 692528 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 147762 | |
| r | 147762 | |
| e | 147762 | |
| n | 124621 | |
| o | 124621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 692528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 147762 | |
| r | 147762 | |
| e | 147762 | |
| n | 124621 | |
| o | 124621 |
loan_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| type1 | |
|---|---|
| type2 | |
| type3 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 743350 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | type1 |
|---|---|
| 2nd row | type2 |
| 3rd row | type1 |
| 4th row | type1 |
| 5th row | type1 |
Common Values
| Value | Count | Frequency (%) |
| type1 | 113173 | |
| type2 | 20762 | 14.0% |
| type3 | 14735 | 9.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| type1 | 113173 | |
| type2 | 20762 | 14.0% |
| type3 | 14735 | 9.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 148670 | |
| y | 148670 | |
| p | 148670 | |
| e | 148670 | |
| 1 | 113173 | |
| 2 | 20762 | 2.8% |
| 3 | 14735 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 594680 | |
| Decimal Number | 148670 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 148670 | |
| y | 148670 | |
| p | 148670 | |
| e | 148670 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 113173 | |
| 2 | 20762 | 14.0% |
| 3 | 14735 | 9.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 594680 | |
| Common | 148670 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 148670 | |
| y | 148670 | |
| p | 148670 | |
| e | 148670 |
Common
| Value | Count | Frequency (%) |
| 1 | 113173 | |
| 2 | 20762 | 14.0% |
| 3 | 14735 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 743350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 148670 | |
| y | 148670 | |
| p | 148670 | |
| e | 148670 | |
| 1 | 113173 | |
| 2 | 20762 | 2.8% |
| 3 | 14735 | 2.0% |
loan_purpose
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 134 |
| Missing (%) | 0.1% |
| Memory size | 1.1 MiB |
| p3 | |
|---|---|
| p4 | |
| p1 | |
| p2 | 3274 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 297072 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | p1 |
|---|---|
| 2nd row | p1 |
| 3rd row | p1 |
| 4th row | p4 |
| 5th row | p1 |
Common Values
| Value | Count | Frequency (%) |
| p3 | 55934 | |
| p4 | 54799 | |
| p1 | 34529 | |
| p2 | 3274 | 2.2% |
| (Missing) | 134 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| p3 | 55934 | |
| p4 | 54799 | |
| p1 | 34529 | |
| p2 | 3274 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 148536 | |
| 3 | 55934 | 18.8% |
| 4 | 54799 | 18.4% |
| 1 | 34529 | 11.6% |
| 2 | 3274 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 148536 | |
| Decimal Number | 148536 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 55934 | |
| 4 | 54799 | |
| 1 | 34529 | |
| 2 | 3274 | 2.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 148536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 148536 | |
| Common | 148536 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 55934 | |
| 4 | 54799 | |
| 1 | 34529 | |
| 2 | 3274 | 2.2% |
Latin
| Value | Count | Frequency (%) |
| p | 148536 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 297072 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 148536 | |
| 3 | 55934 | 18.8% |
| 4 | 54799 | 18.4% |
| 1 | 34529 | 11.6% |
| 2 | 3274 | 1.1% |
business_or_commercial
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| nob/c | |
|---|---|
| b/c |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.7206968 |
| Min length | 3 |
Characters and Unicode
| Total characters | 701826 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nob/c |
|---|---|
| 2nd row | b/c |
| 3rd row | nob/c |
| 4th row | nob/c |
| 5th row | nob/c |
Common Values
| Value | Count | Frequency (%) |
| nob/c | 127908 | |
| b/c | 20762 | 14.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nob/c | 127908 | |
| b/c | 20762 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 148670 | |
| / | 148670 | |
| c | 148670 | |
| n | 127908 | |
| o | 127908 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 553156 | |
| Other Punctuation | 148670 | 21.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 148670 | |
| c | 148670 | |
| n | 127908 | |
| o | 127908 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 148670 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 553156 | |
| Common | 148670 | 21.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| b | 148670 | |
| c | 148670 | |
| n | 127908 | |
| o | 127908 |
Common
| Value | Count | Frequency (%) |
| / | 148670 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 701826 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| b | 148670 | |
| / | 148670 | |
| c | 148670 | |
| n | 127908 | |
| o | 127908 |
loan_amount
Real number (ℝ)
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 331117.74 |
| Minimum | 16500 |
|---|---|
| Maximum | 3576500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 16500 |
|---|---|
| 5-th percentile | 106500 |
| Q1 | 196500 |
| median | 296500 |
| Q3 | 436500 |
| 95-th percentile | 656500 |
| Maximum | 3576500 |
| Range | 3560000 |
| Interquartile range (IQR) | 240000 |
Descriptive statistics
| Standard deviation | 183909.31 |
|---|---|
| Coefficient of variation (CV) | 0.55541968 |
| Kurtosis | 9.1277753 |
| Mean | 331117.74 |
| Median Absolute Deviation (MAD) | 120000 |
| Skewness | 1.6669981 |
| Sum | 4.9227275 × 1010 |
| Variance | 3.3822634 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 206500 | 4610 | 3.1% |
| 256500 | 4079 | 2.7% |
| 156500 | 3967 | 2.7% |
| 226500 | 3944 | 2.7% |
| 486500 | 3819 | 2.6% |
| 306500 | 3691 | 2.5% |
| 246500 | 3669 | 2.5% |
| 216500 | 3649 | 2.5% |
| 236500 | 3553 | 2.4% |
| 266500 | 3543 | 2.4% |
| Other values (201) | 110146 |
| Value | Count | Frequency (%) |
| 16500 | 3 | < 0.1% |
| 26500 | 27 | < 0.1% |
| 36500 | 119 | 0.1% |
| 46500 | 212 | 0.1% |
| 56500 | 810 | 0.5% |
| 66500 | 859 | 0.6% |
| 76500 | 1701 | |
| 86500 | 1605 | |
| 96500 | 1484 | |
| 106500 | 3210 |
| Value | Count | Frequency (%) |
| 3576500 | 1 | < 0.1% |
| 3346500 | 1 | < 0.1% |
| 3006500 | 4 | |
| 2986500 | 1 | < 0.1% |
| 2926500 | 1 | < 0.1% |
| 2706500 | 1 | < 0.1% |
| 2626500 | 1 | < 0.1% |
| 2606500 | 1 | < 0.1% |
| 2596500 | 1 | < 0.1% |
| 2506500 | 2 |
rate_of_interest
Real number (ℝ)
MISSING 
| Distinct | 131 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 36439 |
| Missing (%) | 24.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0454758 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.125 |
| Q1 | 3.625 |
| median | 3.99 |
| Q3 | 4.375 |
| 95-th percentile | 4.99 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0.75 |
Descriptive statistics
| Standard deviation | 0.56139119 |
|---|---|
| Coefficient of variation (CV) | 0.13877013 |
| Kurtosis | 0.34456404 |
| Mean | 4.0454758 |
| Median Absolute Deviation (MAD) | 0.365 |
| Skewness | 0.38840603 |
| Sum | 454027.8 |
| Variance | 0.31516007 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.99 | 14455 | 9.7% |
| 3.625 | 8800 | 5.9% |
| 3.875 | 8592 | 5.8% |
| 3.75 | 8474 | 5.7% |
| 3.5 | 6866 | 4.6% |
| 4.5 | 6809 | 4.6% |
| 4.375 | 6482 | 4.4% |
| 4.25 | 6045 | 4.1% |
| 4.125 | 5797 | 3.9% |
| 4.75 | 4875 | 3.3% |
| Other values (121) | 35036 | |
| (Missing) | 36439 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2.125 | 1 | < 0.1% |
| 2.25 | 4 | < 0.1% |
| 2.375 | 2 | < 0.1% |
| 2.475 | 2 | < 0.1% |
| 2.5 | 21 | |
| 2.575 | 1 | < 0.1% |
| 2.6 | 3 | < 0.1% |
| 2.625 | 25 | |
| 2.65 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7.75 | 1 | < 0.1% |
| 7.5 | 2 | < 0.1% |
| 7.375 | 1 | < 0.1% |
| 7.125 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6.875 | 1 | < 0.1% |
| 6.75 | 5 | |
| 6.5 | 3 | |
| 6.375 | 1 | < 0.1% |
upfront_charges
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 58271 |
|---|---|
| Distinct (%) | 53.4% |
| Missing | 39642 |
| Missing (%) | 26.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3224.9961 |
| Minimum | 0 |
|---|---|
| Maximum | 60000 |
| Zeros | 20770 |
| Zeros (%) | 14.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 581.49 |
| median | 2596.45 |
| Q3 | 4812.5 |
| 95-th percentile | 9272.6885 |
| Maximum | 60000 |
| Range | 60000 |
| Interquartile range (IQR) | 4231.01 |
Descriptive statistics
| Standard deviation | 3251.1215 |
|---|---|
| Coefficient of variation (CV) | 1.0081009 |
| Kurtosis | 6.3685863 |
| Mean | 3224.9961 |
| Median Absolute Deviation (MAD) | 2108.66 |
| Skewness | 1.7540757 |
| Sum | 3.5161488 × 108 |
| Variance | 10569791 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20770 | 14.0% |
| 1250 | 1184 | 0.8% |
| 1150 | 892 | 0.6% |
| 795 | 487 | 0.3% |
| 295 | 403 | 0.3% |
| 950 | 192 | 0.1% |
| 3000 | 173 | 0.1% |
| 995 | 151 | 0.1% |
| 4000 | 149 | 0.1% |
| 5000 | 147 | 0.1% |
| Other values (58261) | 84480 | |
| (Missing) | 39642 |
| Value | Count | Frequency (%) |
| 0 | 20770 | |
| 0.03 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.35 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.72 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 0.92 | 1 | < 0.1% |
| 1 | 12 | < 0.1% |
| 1.15 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 60000 | 1 | |
| 53485.78 | 1 | |
| 38437.5 | 1 | |
| 38375 | 1 | |
| 37604.38 | 1 | |
| 35192.5 | 1 | |
| 33268 | 1 | |
| 32850 | 1 | |
| 32825.25 | 1 | |
| 32647 | 1 |
term
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 41 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 335.13658 |
| Minimum | 96 |
|---|---|
| Maximum | 360 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 96 |
|---|---|
| 5-th percentile | 180 |
| Q1 | 360 |
| median | 360 |
| Q3 | 360 |
| 95-th percentile | 360 |
| Maximum | 360 |
| Range | 264 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 58.409084 |
|---|---|
| Coefficient of variation (CV) | 0.17428442 |
| Kurtosis | 3.1732363 |
| Mean | 335.13658 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.1748218 |
| Sum | 49811015 |
| Variance | 3411.621 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 360 | 121685 | |
| 180 | 12981 | 8.7% |
| 240 | 5859 | 3.9% |
| 300 | 2822 | 1.9% |
| 324 | 2766 | 1.9% |
| 120 | 510 | 0.3% |
| 144 | 263 | 0.2% |
| 348 | 260 | 0.2% |
| 336 | 213 | 0.1% |
| 96 | 194 | 0.1% |
| Other values (16) | 1076 | 0.7% |
| Value | Count | Frequency (%) |
| 96 | 194 | 0.1% |
| 108 | 33 | < 0.1% |
| 120 | 510 | 0.3% |
| 132 | 93 | 0.1% |
| 144 | 263 | 0.2% |
| 156 | 174 | 0.1% |
| 165 | 1 | < 0.1% |
| 168 | 82 | 0.1% |
| 180 | 12981 | |
| 192 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 360 | 121685 | |
| 348 | 260 | 0.2% |
| 336 | 213 | 0.1% |
| 324 | 2766 | 1.9% |
| 322 | 1 | < 0.1% |
| 312 | 185 | 0.1% |
| 300 | 2822 | 1.9% |
| 288 | 90 | 0.1% |
| 280 | 1 | < 0.1% |
| 276 | 100 | 0.1% |
property_value
Real number (ℝ)
MISSING 
| Distinct | 385 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 15098 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 497893.47 |
| Minimum | 8000 |
|---|---|
| Maximum | 16508000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 8000 |
|---|---|
| 5-th percentile | 148000 |
| Q1 | 268000 |
| median | 418000 |
| Q3 | 628000 |
| 95-th percentile | 1058000 |
| Maximum | 16508000 |
| Range | 16500000 |
| Interquartile range (IQR) | 360000 |
Descriptive statistics
| Standard deviation | 359935.32 |
|---|---|
| Coefficient of variation (CV) | 0.72291633 |
| Kurtosis | 73.221196 |
| Mean | 497893.47 |
| Median Absolute Deviation (MAD) | 170000 |
| Skewness | 4.5862758 |
| Sum | 6.6504626 × 1010 |
| Variance | 1.2955343 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 308000 | 2792 | 1.9% |
| 258000 | 2763 | 1.9% |
| 358000 | 2679 | 1.8% |
| 408000 | 2537 | 1.7% |
| 328000 | 2524 | 1.7% |
| 278000 | 2513 | 1.7% |
| 268000 | 2497 | 1.7% |
| 228000 | 2493 | 1.7% |
| 238000 | 2408 | 1.6% |
| 288000 | 2398 | 1.6% |
| Other values (375) | 107968 | |
| (Missing) | 15098 | 10.2% |
| Value | Count | Frequency (%) |
| 8000 | 6 | < 0.1% |
| 18000 | 1 | < 0.1% |
| 28000 | 9 | < 0.1% |
| 38000 | 35 | < 0.1% |
| 48000 | 71 | < 0.1% |
| 58000 | 141 | 0.1% |
| 68000 | 271 | |
| 78000 | 387 | |
| 88000 | 568 | |
| 98000 | 556 |
| Value | Count | Frequency (%) |
| 16508000 | 1 | |
| 12008000 | 1 | |
| 11008000 | 1 | |
| 10008000 | 1 | |
| 9268000 | 1 | |
| 8508000 | 1 | |
| 7608000 | 1 | |
| 6908000 | 1 | |
| 6508000 | 1 | |
| 6408000 | 1 |
construction_type
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| sb | |
|---|---|
| mh | 33 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 297340 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | sb |
|---|---|
| 2nd row | sb |
| 3rd row | sb |
| 4th row | sb |
| 5th row | sb |
Common Values
| Value | Count | Frequency (%) |
| sb | 148637 | |
| mh | 33 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sb | 148637 | |
| mh | 33 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 148637 | |
| b | 148637 | |
| m | 33 | < 0.1% |
| h | 33 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 297340 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 148637 | |
| b | 148637 | |
| m | 33 | < 0.1% |
| h | 33 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 297340 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 148637 | |
| b | 148637 | |
| m | 33 | < 0.1% |
| h | 33 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 297340 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 148637 | |
| b | 148637 | |
| m | 33 | < 0.1% |
| h | 33 | < 0.1% |
income
Real number (ℝ)
MISSING 
| Distinct | 1001 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 9150 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6957.3389 |
| Minimum | 0 |
|---|---|
| Maximum | 578580 |
| Zeros | 1260 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1920 |
| Q1 | 3720 |
| median | 5760 |
| Q3 | 8520 |
| 95-th percentile | 15420 |
| Maximum | 578580 |
| Range | 578580 |
| Interquartile range (IQR) | 4800 |
Descriptive statistics
| Standard deviation | 6496.5864 |
|---|---|
| Coefficient of variation (CV) | 0.93377461 |
| Kurtosis | 885.29246 |
| Mean | 6957.3389 |
| Median Absolute Deviation (MAD) | 2280 |
| Skewness | 17.307695 |
| Sum | 9.7068792 × 108 |
| Variance | 42205635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1260 | 0.8% |
| 3600 | 1250 | 0.8% |
| 4200 | 1243 | 0.8% |
| 4800 | 1191 | 0.8% |
| 3120 | 1168 | 0.8% |
| 3720 | 1161 | 0.8% |
| 3900 | 1159 | 0.8% |
| 5400 | 1152 | 0.8% |
| 3300 | 1144 | 0.8% |
| 4500 | 1139 | 0.8% |
| Other values (991) | 127653 | |
| (Missing) | 9150 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 1260 | |
| 60 | 5 | < 0.1% |
| 120 | 12 | < 0.1% |
| 180 | 12 | < 0.1% |
| 240 | 15 | < 0.1% |
| 300 | 18 | < 0.1% |
| 360 | 11 | < 0.1% |
| 420 | 15 | < 0.1% |
| 480 | 11 | < 0.1% |
| 540 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 578580 | 1 | |
| 377220 | 1 | |
| 374400 | 1 | |
| 335880 | 2 | |
| 329460 | 1 | |
| 322860 | 1 | |
| 312000 | 1 | |
| 240000 | 1 | |
| 235980 | 1 | |
| 198060 | 1 |
credit_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| CIB | |
|---|---|
| CRIF | |
| EXP | |
| EQUI |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.3981906 |
| Min length | 3 |
Characters and Unicode
| Total characters | 505209 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EXP |
|---|---|
| 2nd row | EQUI |
| 3rd row | EXP |
| 4th row | EXP |
| 5th row | CRIF |
Common Values
| Value | Count | Frequency (%) |
| CIB | 48152 | |
| CRIF | 43901 | |
| EXP | 41319 | |
| EQUI | 15298 | 10.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cib | 48152 | |
| crif | 43901 | |
| exp | 41319 | |
| equi | 15298 | 10.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 107351 | |
| C | 92053 | |
| E | 56617 | |
| B | 48152 | |
| R | 43901 | |
| F | 43901 | |
| X | 41319 | 8.2% |
| P | 41319 | 8.2% |
| Q | 15298 | 3.0% |
| U | 15298 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 505209 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 107351 | |
| C | 92053 | |
| E | 56617 | |
| B | 48152 | |
| R | 43901 | |
| F | 43901 | |
| X | 41319 | 8.2% |
| P | 41319 | 8.2% |
| Q | 15298 | 3.0% |
| U | 15298 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 505209 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 107351 | |
| C | 92053 | |
| E | 56617 | |
| B | 48152 | |
| R | 43901 | |
| F | 43901 | |
| X | 41319 | 8.2% |
| P | 41319 | 8.2% |
| Q | 15298 | 3.0% |
| U | 15298 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 505209 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 107351 | |
| C | 92053 | |
| E | 56617 | |
| B | 48152 | |
| R | 43901 | |
| F | 43901 | |
| X | 41319 | 8.2% |
| P | 41319 | 8.2% |
| Q | 15298 | 3.0% |
| U | 15298 | 3.0% |
credit_score
Real number (ℝ)
| Distinct | 401 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 699.7891 |
| Minimum | 500 |
|---|---|
| Maximum | 900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 519 |
| Q1 | 599 |
| median | 699 |
| Q3 | 800 |
| 95-th percentile | 881 |
| Maximum | 900 |
| Range | 400 |
| Interquartile range (IQR) | 201 |
Descriptive statistics
| Standard deviation | 115.87586 |
|---|---|
| Coefficient of variation (CV) | 0.16558683 |
| Kurtosis | -1.2026494 |
| Mean | 699.7891 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 0.004766757 |
| Sum | 1.0403765 × 108 |
| Variance | 13427.214 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 763 | 415 | 0.3% |
| 867 | 413 | 0.3% |
| 639 | 411 | 0.3% |
| 581 | 408 | 0.3% |
| 554 | 407 | 0.3% |
| 519 | 406 | 0.3% |
| 737 | 406 | 0.3% |
| 890 | 406 | 0.3% |
| 687 | 405 | 0.3% |
| 617 | 405 | 0.3% |
| Other values (391) | 144588 |
| Value | Count | Frequency (%) |
| 500 | 357 | |
| 501 | 357 | |
| 502 | 346 | |
| 503 | 383 | |
| 504 | 392 | |
| 505 | 379 | |
| 506 | 380 | |
| 507 | 386 | |
| 508 | 400 | |
| 509 | 348 |
| Value | Count | Frequency (%) |
| 900 | 393 | |
| 899 | 352 | |
| 898 | 370 | |
| 897 | 383 | |
| 896 | 391 | |
| 895 | 371 | |
| 894 | 361 | |
| 893 | 348 | |
| 892 | 366 | |
| 891 | 376 |
age
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 200 |
| Missing (%) | 0.1% |
| Memory size | 1.1 MiB |
| 45-54 | |
|---|---|
| 35-44 | |
| 55-64 | |
| 65-74 | |
| 25-34 | |
| Other values (2) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.8853371 |
| Min length | 3 |
Characters and Unicode
| Total characters | 725326 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25-34 |
|---|---|
| 2nd row | 55-64 |
| 3rd row | 35-44 |
| 4th row | 45-54 |
| 5th row | 25-34 |
Common Values
| Value | Count | Frequency (%) |
| 45-54 | 34720 | |
| 35-44 | 32818 | |
| 55-64 | 32534 | |
| 65-74 | 20744 | |
| 25-34 | 19142 | |
| >74 | 7175 | 4.8% |
| <25 | 1337 | 0.9% |
| (Missing) | 200 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 45-54 | 34720 | |
| 35-44 | 32818 | |
| 55-64 | 32534 | |
| 65-74 | 20744 | |
| 25-34 | 19142 | |
| 74 | 7175 | 4.8% |
| 25 | 1337 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 214671 | |
| 5 | 208549 | |
| - | 139958 | |
| 6 | 53278 | 7.3% |
| 3 | 51960 | 7.2% |
| 7 | 27919 | 3.8% |
| 2 | 20479 | 2.8% |
| > | 7175 | 1.0% |
| < | 1337 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 576856 | |
| Dash Punctuation | 139958 | 19.3% |
| Math Symbol | 8512 | 1.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 214671 | |
| 5 | 208549 | |
| 6 | 53278 | 9.2% |
| 3 | 51960 | 9.0% |
| 7 | 27919 | 4.8% |
| 2 | 20479 | 3.6% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 7175 | |
| < | 1337 | 15.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 139958 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 725326 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 214671 | |
| 5 | 208549 | |
| - | 139958 | |
| 6 | 53278 | 7.3% |
| 3 | 51960 | 7.2% |
| 7 | 27919 | 3.8% |
| 2 | 20479 | 2.8% |
| > | 7175 | 1.0% |
| < | 1337 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 725326 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 214671 | |
| 5 | 208549 | |
| - | 139958 | |
| 6 | 53278 | 7.3% |
| 3 | 51960 | 7.2% |
| 7 | 27919 | 3.8% |
| 2 | 20479 | 2.8% |
| > | 7175 | 1.0% |
| < | 1337 | 0.2% |
ltv
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 8484 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 15098 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 72.746457 |
| Minimum | 0.9674782 |
|---|---|
| Maximum | 7831.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0.9674782 |
|---|---|
| 5-th percentile | 36.350575 |
| Q1 | 60.47486 |
| median | 75.13587 |
| Q3 | 86.184211 |
| 95-th percentile | 98.728814 |
| Maximum | 7831.25 |
| Range | 7830.2825 |
| Interquartile range (IQR) | 25.70935 |
Descriptive statistics
| Standard deviation | 39.967603 |
|---|---|
| Coefficient of variation (CV) | 0.54940961 |
| Kurtosis | 19979.045 |
| Mean | 72.746457 |
| Median Absolute Deviation (MAD) | 12.514733 |
| Skewness | 120.61534 |
| Sum | 9716889.8 |
| Variance | 1597.4093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81.25 | 530 | 0.4% |
| 91.66666667 | 499 | 0.3% |
| 80.03875969 | 380 | 0.3% |
| 80.03246753 | 328 | 0.2% |
| 94.95614035 | 322 | 0.2% |
| 78.84615385 | 317 | 0.2% |
| 78.64583333 | 310 | 0.2% |
| 79.04040404 | 309 | 0.2% |
| 80.06329114 | 309 | 0.2% |
| 95.16806723 | 306 | 0.2% |
| Other values (8474) | 129962 | |
| (Missing) | 15098 | 10.2% |
| Value | Count | Frequency (%) |
| 0.967478198 | 1 | |
| 2.072942643 | 1 | |
| 2.767587397 | 1 | |
| 2.81374502 | 1 | |
| 2.856420627 | 1 | |
| 2.992584746 | 1 | |
| 3.083554377 | 1 | |
| 3.125 | 1 | |
| 3.74668435 | 1 | |
| 3.875171468 | 1 |
| Value | Count | Frequency (%) |
| 7831.25 | 1 | |
| 6706.25 | 1 | |
| 5206.25 | 1 | |
| 4706.25 | 1 | |
| 2956.25 | 1 | |
| 2331.25 | 1 | |
| 263.5416667 | 1 | |
| 237.5 | 2 | |
| 220.3629032 | 1 | |
| 201.7857143 | 1 |
region
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| North | |
|---|---|
| south | |
| central | |
| North-East | 1235 |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.1585323 |
| Min length | 5 |
Characters and Unicode
| Total characters | 766919 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | south |
|---|---|
| 2nd row | North |
| 3rd row | south |
| 4th row | North |
| 5th row | North |
Common Values
| Value | Count | Frequency (%) |
| North | 74722 | |
| south | 64016 | |
| central | 8697 | 5.8% |
| North-East | 1235 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| north | 74722 | |
| south | 64016 | |
| central | 8697 | 5.8% |
| north-east | 1235 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 149905 | |
| o | 139973 | |
| h | 139973 | |
| r | 84654 | |
| N | 75957 | |
| s | 65251 | |
| u | 64016 | |
| a | 9932 | 1.3% |
| c | 8697 | 1.1% |
| e | 8697 | 1.1% |
| Other values (4) | 19864 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 688492 | |
| Uppercase Letter | 77192 | 10.1% |
| Dash Punctuation | 1235 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 149905 | |
| o | 139973 | |
| h | 139973 | |
| r | 84654 | |
| s | 65251 | |
| u | 64016 | |
| a | 9932 | 1.4% |
| c | 8697 | 1.3% |
| e | 8697 | 1.3% |
| n | 8697 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 75957 | |
| E | 1235 | 1.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 765684 | |
| Common | 1235 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 149905 | |
| o | 139973 | |
| h | 139973 | |
| r | 84654 | |
| N | 75957 | |
| s | 65251 | |
| u | 64016 | |
| a | 9932 | 1.3% |
| c | 8697 | 1.1% |
| e | 8697 | 1.1% |
| Other values (3) | 18629 | 2.4% |
Common
| Value | Count | Frequency (%) |
| - | 1235 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 766919 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 149905 | |
| o | 139973 | |
| h | 139973 | |
| r | 84654 | |
| N | 75957 | |
| s | 65251 | |
| u | 64016 | |
| a | 9932 | 1.3% |
| c | 8697 | 1.1% |
| e | 8697 | 1.1% |
| Other values (4) | 19864 | 2.6% |
status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 148670 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 148670 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 148670 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148670 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 112031 | |
| 1 | 36639 | 24.6% |
| gender | approv_in_adv | loan_type | loan_purpose | business_or_commercial | loan_amount | rate_of_interest | upfront_charges | term | property_value | construction_type | income | credit_type | credit_score | age | ltv | region | status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Sex Not Available | nopre | type1 | p1 | nob/c | 116500 | NaN | NaN | 360.0 | 118000.0 | sb | 1740.0 | EXP | 758 | 25-34 | 98.728814 | south | 1 |
| 1 | Male | nopre | type2 | p1 | b/c | 206500 | NaN | NaN | 360.0 | NaN | sb | 4980.0 | EQUI | 552 | 55-64 | NaN | North | 1 |
| 2 | Male | pre | type1 | p1 | nob/c | 406500 | 4.560 | 595.00 | 360.0 | 508000.0 | sb | 9480.0 | EXP | 834 | 35-44 | 80.019685 | south | 0 |
| 3 | Male | nopre | type1 | p4 | nob/c | 456500 | 4.250 | NaN | 360.0 | 658000.0 | sb | 11880.0 | EXP | 587 | 45-54 | 69.376900 | North | 0 |
| 4 | Joint | pre | type1 | p1 | nob/c | 696500 | 4.000 | 0.00 | 360.0 | 758000.0 | sb | 10440.0 | CRIF | 602 | 25-34 | 91.886544 | North | 0 |
| 5 | Joint | pre | type1 | p1 | nob/c | 706500 | 3.990 | 370.00 | 360.0 | 1008000.0 | sb | 10080.0 | EXP | 864 | 35-44 | 70.089286 | North | 0 |
| 6 | Joint | pre | type1 | p3 | nob/c | 346500 | 4.500 | 5120.00 | 360.0 | 438000.0 | sb | 5040.0 | EXP | 860 | 55-64 | 79.109589 | North | 0 |
| 7 | Female | nopre | type1 | p4 | nob/c | 266500 | 4.125 | 5609.88 | 360.0 | 308000.0 | sb | 3780.0 | CIB | 863 | 55-64 | 86.525974 | North | 0 |
| 8 | Joint | nopre | type1 | p3 | nob/c | 376500 | 4.875 | 1150.00 | 360.0 | 478000.0 | sb | 5580.0 | CIB | 580 | 55-64 | 78.765690 | central | 0 |
| 9 | Sex Not Available | nopre | type3 | p3 | nob/c | 436500 | 3.490 | 2316.50 | 360.0 | 688000.0 | sb | 6720.0 | CIB | 788 | 55-64 | 63.444767 | south | 0 |
| gender | approv_in_adv | loan_type | loan_purpose | business_or_commercial | loan_amount | rate_of_interest | upfront_charges | term | property_value | construction_type | income | credit_type | credit_score | age | ltv | region | status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 148660 | Female | nopre | type1 | p4 | nob/c | 366500 | 3.875 | 3643.16 | 360.0 | 658000.0 | sb | 7200.0 | CIB | 851 | 45-54 | 55.699088 | North | 0 |
| 148661 | Sex Not Available | nopre | type2 | p4 | b/c | 346500 | NaN | NaN | 360.0 | 358000.0 | sb | NaN | EXP | 585 | 25-34 | 96.787710 | south | 1 |
| 148662 | Joint | nopre | type1 | p4 | nob/c | 646500 | 3.625 | 7639.80 | 360.0 | 828000.0 | sb | 13500.0 | CIB | 873 | 45-54 | 78.079710 | North | 0 |
| 148663 | Male | nopre | type2 | p1 | b/c | 106500 | NaN | NaN | 360.0 | NaN | sb | 1860.0 | EQUI | 619 | <25 | NaN | North | 1 |
| 148664 | Joint | nopre | type2 | p1 | b/c | 156500 | 3.990 | 3113.06 | 360.0 | 158000.0 | sb | 4020.0 | EXP | 859 | 65-74 | 99.050633 | central | 0 |
| 148665 | Sex Not Available | nopre | type1 | p3 | nob/c | 436500 | 3.125 | 9960.00 | 180.0 | 608000.0 | sb | 7860.0 | CIB | 659 | 55-64 | 71.792763 | south | 0 |
| 148666 | Male | nopre | type1 | p1 | nob/c | 586500 | 5.190 | 0.00 | 360.0 | 788000.0 | sb | 7140.0 | CIB | 569 | 25-34 | 74.428934 | south | 0 |
| 148667 | Male | nopre | type1 | p4 | nob/c | 446500 | 3.125 | 1226.64 | 180.0 | 728000.0 | sb | 6900.0 | CIB | 702 | 45-54 | 61.332418 | North | 0 |
| 148668 | Female | nopre | type1 | p4 | nob/c | 196500 | 3.500 | 4323.33 | 180.0 | 278000.0 | sb | 7140.0 | EXP | 737 | 55-64 | 70.683453 | North | 0 |
| 148669 | Female | nopre | type1 | p3 | nob/c | 406500 | 4.375 | 6000.00 | 240.0 | 558000.0 | sb | 7260.0 | CIB | 830 | 45-54 | 72.849462 | North | 0 |
Most frequently occurring
| gender | approv_in_adv | loan_type | loan_purpose | business_or_commercial | loan_amount | rate_of_interest | upfront_charges | term | property_value | construction_type | income | credit_type | credit_score | age | ltv | region | status | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Male | nopre | type2 | p4 | b/c | 236500 | NaN | NaN | 360.0 | 248000.0 | sb | 3120.0 | CRIF | 673 | 35-44 | 95.362903 | central | 1 | 2 |